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Fusion Proteins Comprising GM-CSF and IL-3 

5 

CROSS-REFERENCE TO RELATED APPT jrATTON 

This application is a continutation-in-part of U,S. application Serial No. 397,146, filed 
August 22, 1989, now pending. 

10 BACKGROUND OF THE TNVENTTON 

The present invention relates generally to analogs of GM-CSF and IL-3 proteins, and 
particularly to the construction of fusion proteins comprising GM-CSF and 11^3. 

The differentiation and proliferation of hematopoietic cells is regulated by secreted 
glycoproteins collectively known as colony-stimulating factors (CSFs). In humans, these 

15 proteins include granulocyte-macrophage CSF (GM-CSF), which promotes granulocyte and 
macrophage production from normal bone marrow, and which also appears to regulate the 
activity of mature, differentiated granulocytes and macrophages. IL-3 (also known as multi- 
CSF) also stimulates formation of a broad range of hematopoietic cells, including granulocytes, 
macrophages, eosinophils, mast cells, megakaryocytes and erythroid cells. GM-CSF and IL-3 

20 tiius have considerable overlap in their broad range of biological activities. Other CSFs have a 
more restricted range of activity, macrophage CSF (M-CSF) stimulating almost exclusively 
macrophage colony formation, and granulocyte CSF (G-CSF) stimulating primarily 
granulocyte colonies. Altiiough GM-CSF and IL-3 have distinct amino acid sequences, 
preclinical studies indicate tfiat they may be useful to treat various cytopenias, and to potentiate 

25 immune responsiveness to infectious pathogens, and to assist in reconstituting normal blood 
cell populations following viral infection or radiation or chemotherapy-induced hematopoietic 
cell suppression. The genes encoding GM-CSF and IL-3 are located on the same chromosome 
in mouse and man and the expression of the genes is linked in some cells, such as activated T 
lymphocytes (Kelso et al., /. Immunol. 736:1718, 1986; Yang et al., Blood 7i :958, 1988; 

30 Barlow et al., EMBO J. d:617, 1987). 

Short-term experiments have demonstrated that the simultaneous combination of GM- 
CSF and IL-3 was more effective than either GM-CSF or IL-3 alone in increasing cycling rates 
and numbers of marrow hematopoietic progenitor cells in vitro in lactoferrin-treated mice 
(Broxmeyer et al., Proc. Natl Acad. ScL USA 54:3871, 1987). No such synergy has been 

35 observed in vivo for simultaneous administration of GM-CSF and IL-3, although clinical 
studies have shown that the consecutive administration of recombinant human IL-3 and 
recombinant human GM-CSF was more effective in raising white blood cell counts in normal 
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cynomolgus monkeys than either GM-CSF or 11^3 alone (Krumwieh et al., Behring Inst. Mitt. 
53:250, 1988; Donahue et al., Science 247:1820, 1988). 

The biological activities of GM-CSF and IL-3 are mediated by binding to specific cell 
surface receptors expressed on primary cells and in vitro cell lines. GM-CSF and IL-3 each 
5 bind to their Tesj)ective receptor, resulting in transduction of a biological signal to various 
immune effector cells. Recent studies of the characteristics and distribution of the receptor for 
IL-3 on tiie human myelogenous leukemia cell line KG-1 and human pre-B cell line JM-1 
indicate that a subclass of receptor exists which also binds GM-CSF (Park et al., J. Biol. 
Chem. 264:5420, 1989). These stodies showed tiiat human GM-CSF is capable of almost 

10 completely inhibiting the binding of l25i-iL-3 to KG-1 cells and, conversely, that IL-3 is 
capable of substantially inhibiting binding of ^25i.gm-CSF to the same cells. This direct 
competition between GM-CSF and IL-3 for a single cell surface receptor indicates that a single 
receptor is capable of binding both GM-CSF and IL-3. Altiiough is not yet clear whetiier tiie 
heterogeneity in IL-3 and GM-CSF binding is due to the existence of a receptor molecule 

15 which is distinct from that which binds IL-3 alone or GM-CSF alone, or whether IL-3 and 
GM-CSF receptors may exist as multisubunit complexes, composed of different ratios of IL-3 
and GM-CSF binding proteins, the receptor(s) will be referred to herein as the GM-CSF/IL-3 
receptor. 

20 SUMMARY OF THE TNVFlVTTnN 

The present invention is a fusion protein comprising GM-CSF and IL-3. The fusion 
proteins have a formula selected from the group consisting of 

R1-R2, R2-R1, R1-L-R2 and R2-L-R1 

25. 

wherein Ri is GM-CSF; R2 is IL-3; and L is a linker peptide sequence. In preferreed aspects 
of the present invention, GM-CSF and IL-3 are linked togedier via a linker sequence which 
does not interfere with the folding of either the GM-CSF or IL-3 domains. 

The fusion proteins of the present invention are more biologically active than GM-CSF 
30 or IL-3 alone or in combination and, relative to IL-3, have a significantiy higher affinity of 
binding to cell lines which have GM-CSF/IL-3 receptors compared to cell lines with only IL-3 
or GM-CSF receptors. 

BRIEF DESCRIPTION OF THE DRAWTNHS 
35 HGURE 1 is a nucleotide sequence and corresponding amino acid sequence of a 

human GM-CSF/IL-3 fusion protein. The C-terminus of human GM-CSF (amino acids 1- 
127) is linked to the N-terminus of human IL-3 (amino acids 139-271) via a linker sequence 
(amino acids 128-138). 
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FIGURE 2 is a nucleotide sequence and corresponding amino acid sequence of a 
human IL-3/GM-CSF fusion protein. The C-teraiinus of human IL-3 (amino adds 1-133) is 
linked to the N-tenninus of human GM-CSF (amino acids 149-275 via a linker sequence 
(amino acids 134-148). 

HGURE 3A-3D are graphs showing that the GM-CSF/IL-3 fusion protein (□) 
enhances BFU-E (FIG. 3A), CFU-GM (FIGS. 3B and 3C) and CFU-GEMM (HG. 3D) 
colony formation relative to IL-3 (•) or GM-CSF (O) alone, or to IL-3. and GM-CSF 
combined (A). 



10 DETAILED DESCRTPTTON O F THF. TNVFNTTniSf 

Definitions 

The term "GM-CSF" refers to proteins having amino acid sequences which are 
substantially similar to tiie native human granulocyte-macrophage colony-stimulating factor 
amino add sequences (e.g.. ATCC 53157) and which are biologically active in tiiat tiiey are 
15 capable of binding to GM-CSF receptors, transducing a biological signal initiated by binding 
GM-CSF receptors, or cross-reacting witii anti-GM-CSF antibodies raised against GM-CSF. 
Such sequences are disclosed, for example, in Anderson et al. {Proc. Nat'l. Acad. Sci. USA 
52:6250, 1985). The term "GM-CSF" also indudes analogs of GM-CSF molecules which 
exhibit at least some biological activity in common with native human GM-CSF. Exemplary 
20 analogs of GM-CSF are disclosed in EP Publ. No. 212914, which describes GM-CSF analogs 
having KEX2 protease cleavage sites inactivated so as to increase expression of GM-CSF in 
yeast hosts, and in WO Publ. No. 89/03881, which describes GM-CSF analogs having 
various glycosylation sites eliminated. Otiier GM-CSF analogs which are described herein 
may also be used to construct fusion proteins with 11^3. Furthermore, tiiose skilled in the an 
25 of mutagenesis wiU appreciate tiiat other analogs, as yet undisclosed or undiscovered, may be 
used to construct GM-CSF/IL-3 fusion protdns as described herein. A DNA sequence 
encoding a particularly preferted GM-CSF protein having potential glycosylation sites removed 
has been deposited with tiie American Type Culture Collection under accession number ATCC 
67231 (GM-CSF[Leu23Asp27Glu39]). The nomendature used herein to specify amino acid 
sequences designates amino acids that differ from the native form in brackets immediately 
following the protein name and designates the spedes with which tiie protein is associated 
immediately preceding tiie protein name. Thus. huGM-CSF[Leu23Asp27Glu29] refers to a 
human GM-CSF in which amino add 23 has been changed to a leudne residue, amino acid 27 
has been changed to an asparagine residue, and amino acid 29 has been changed to glutamic 
35 add residue. 

The term "IL-3" refers to protdns having amino acid sequences which are substantially 
similar to tiie native human Interleukin-3 amino acid sequences and which are biologically 



30 
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active in that they are capable of binding to IL-3 receptors or transducing a biological signal 
initiated by binding to IL-3 receptors, or cross-reacting with anti-IL-3 antibodies raised against 
IL-3. Such sequences are disclosed, for example, in EP Publ. Nos. 275,598 and 282,185. 
The tenn "IL-3** also includes analogs of IL-3 molecules which exhibit at least some biological 
5 activity in common with native IL-3. Exemplary analogs of IL-3 are also disclosed in EP Publ. 
No. 282,185. Particularly preferred forms of IL-3 which may be fused to GM-CSF in 
accordance with the present invention include huIL-3[Pro8Aspl5A«p70], huIL- 
3[Ser8Aspl5Asp70], and huIL-3[Ser8]. A DNA sequence aicoding another IL-3 protein 
suitable for incorporation into fusion proteins as described herein is on deposit with ATCC 

10 under accession number ATCC 67747. 

As used herein, the term "fusion protein" refers to a C-terminal to N-terminal fusion of 
GM-CSF and IL-3. The fusion proteins of the present invention include constructs in which 
the C-terminal portion of GM-CSF is fused to the N-terminal portion of IL-3, and also 
constructs in which the C-tenninal portion of IL-3 is fused to the N-terminal portion of GM- 

15 CSF. Specifically, the fusion proteins of the present invention have a formula selected firom 
the group consisting of 

R1-R2, R2-R1. R1-L-R2 and R2-L-R1 

20 wherein Ri is GM-CSF; R2 is IL-3; and L is a linker peptide sequence. GM-CSF is linked to 
IL-3 in such a manner as to produce a single protein which retains the biological activity of 
GM-CSF and IL-3. Specific fusion protein constructs are named by listing the GM-CSF and 
IL-3 domains in the fusion protein in their order of occuirence (with the N terminal domain 
specified first, followed by the C-terminal domain). Thus, GM-CSF/IL-3 refers to a fusion 

25 . protein comprising GM-CSF followed by IL-3 (i.e., the C-terminus of GM-CSF is linked to 
the N-terminus of IL-3). Unless otherwise specified, the terms GM-CSF/I1^3 and IL-3/GM- 
CSF refer to fusion proteins with a linker sequence added. Similarly, huGM- 
CSFll^u23Asp2'7Glu39]/huIL-3IPro8Asp^5Asp'70] refers to a fusion protein in which the N- 
terminal region of the fusion construct is huGM-CSF[Leu23Asp^7Glu39], and the C-terminal 

30 region is huIL-3IPro8Aspl5Asp70]. 

The term "substantially identical," when used to define either amino add or nucleic acid 
sequences, means that a particular subject sequence, for example, a mutant sequence, is 
substantially full-length and varies from the sequence of Figures 1 or 2 by one or more 
substitutions, deletions, or additions, the net effect of which is to retain biological activity of 

35 the protein when derived as a GM-CSF/IL-3 or IL-3/GM-CSF fusion protein. Alternatively, 
DNA analog sequences arc "substantially identical" to the specific DNA sequences disclosed 
herein if: (a) the DNA analog sequence is derived from substantially the entire coding regions 
of the native manmialian GM-CSF and IL-3 genes; or (b) the DNA analog sequence is 
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comparable in length with and capable of hybridization to DNA sequences of (a) under 
moderately stringent conditions and which encode biologically active GM-CSF or IL-3 
molecules; or (c) DNA sequences which are degenerate as a result of the genetic code to the 
DNA analog sequences defined in (a) or (b) and which encode biologically active GM-CSF or 
5 IL-3 molecules. Substantially identical analog proteins will be greater than about 80 percent 
similar to the corresponding sequence of the native protein. Sequences having lesser degrees 
of similarity but comparable biological activity are considered to be equivalents. In defining 
nucleic acid sequences, all subject nucleic acid sequences capable of encoding substantially 
similar amino acid sequences are considered substantially similar to a reference nucleic acid 
10 sequence. 

Percent similarity may be determined, for example, by comparing sequence inforaiation 
using the GAP computer program, version 6.0, available from die University of Wisconsin 
Genetics Computer Group (UWGCG). The GAP program utilizes the alignment method of 
Needleman and Wunsch (/. MoL Biol 48:443, 1970), as revised by Smith and Watennan 

15 {Adv, AppL Math. 2:482, 1981). Briefly, the GAP program defines similarity as the number 
of aligned symbols (i.e., nucleotides or amino acids) which are similar, divided by the total 
number of symbols in the shorter of die two sequences. The preferred default parameters for 
the GAP program include: (1) a unaiy comparison matrix (containing a value of 1 for identities 
and 0 for non-identities) for nucleotides, and the weighted comparison matrix of Gribskov and 

20 Burgess, NucL Acids Res, 14:61 AS, 1986, as described by Schwartz and Dayhoff, eds.. Atlas 
of Protein Sequence and Structure, National Biomedical Research Foundation, pp. 353-358, 
1979; (2) a penalty of 3.0 for each gap and an additional 0. 10 penalty for each symbol in each 
gap; and (3) no penalty for end gaps, 

"Recombinant," as used herein, means that a protein is derived from recombinant 

25 (e.g., microbial or mammalian) expression systems. "Microbial" refers to recombinant 
proteins made in bacterial or fungal (e.g., yeast) expression systems. As a product, 
"recombinant microbial" defines a protein produced in a microbial expression system which is 
essentially fi-ee of native endogenous substances. Protein expressed in most bacterial cultures, 
e.g., £. coli, will be free of glycan. Protein expressed in yeast may have a glycosylarion 

30 pattern different from that expressed in mammalian cells. 

"Biologically active," as used throughout the specification means that a particular 
molecule shares sufBcient amino acid sequence similarity witfi the embodiments of the present 
invention -disclosed herein to be capable of binding to GM-CSF leceptor, IL-3 receptor or GM- 
CSFUL'Z ceptor (see, e.g., Paric et al., /, Biol Chem. 2(54:5420, 1989), transmitting a GM- 

35 CSF and/or IL-3 stimulus to a cell, or cross-reacting witii antibodies raised against GM-CSF or 
IL-3. 
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•DNA sequence" refers to a DNA polymer, in the form of a separate fragment or as a 
component of a larger DNA construct. Preferably, the DNA sequences are in a quantity or 
concentration enabling identification, manipulation, and recovery of the sequence and its 
component nucleotide sequences by standard biochemical methods, for example, using a 
5 cloning vector. Such sequences are preferably provided in the form of an open reading frame 
uninterrupted by internal nontranslated sequences, or introns, which are typically present in 
eukaryotic genes. Genomic DNA containing the relevant sequences could also be used. 
Sequences of non-translated DNA may be present 5' or 3' from the open reading frame, where 
the same do not interfere with manipulation or expression of the coding regions. 

10 "Nucleotide sequence" refers to a heteropolymer of deoxyribonucleotides. DNA 

sequences encoding the proteins provided of this invention can be assembled from cDNA 
fragments and short oligonucleotide linkers, or from a series of oligonucleotides, to provide a 
synthetic gene which is capable of being expressed in a recombinant transcriptional unit 

"Recombinant expression vector" refers to a replicable DNA construct used either to 

15 amplify or to express DNA which encodes the fusion proteins of the present invention and 
which includes a transcriptional unit comprising an assembly of (1) a genetic element or 
elements having a regulatory role in gene expression, for example, promoters or enhancers, (2) 
a structural or coding sequence which is transcribed into mRNA and translated into protein, 
and (3) appropriate transcription and translation initiation and termination sequences, 

20 Structural elements intended for use in yeast expression systems preferably include a leader 
sequence enabling extracellular secretion of translated protein by a host cell. Alternatively, 
where recombinant protein is expressed without a leader or transport sequence, it may include 
an N-terminal methionine residue. This residue may optionally be subsequentiy cleaved from 
the expressed recombinant protein to provide a final product. "Recombinant microbial 

25 expression system" means a substantially homogeneous monoculture of suitable host 
microorganisms, for example, bacteria such as E, coli or yeast such as 5. cerevisiaCj which 
have stably integrated a recombinant transcriptional unit into chromosomal DNA or carry the 
recombinant transcriptional unit as a component of a resident plasmid. Generally, cells 
constituting the system are the progeny of a single ancestral transformant Recombinant 

30 expression systems as defined herein will express heterologous protein upon induction of the 
regulatory elements linked to the DNA sequence or synthetic gene to be expressed. 

Construction of cDNA Sequences Encoding Fusion Proteins Comprising GM-CSF and IL-S 
A DNA sequence encoding a fusion protein is constructed using recombinant DNA 
35 techniques to assemble separate DNA fragments encoding GM-CSF and IL-3 into an 
appropriate expression vector. The 3' end of a DNA fragment encoding GM-CSF is ligated to 
the 5* end of tiie DNA fragment encoding IL-3, witii the reading frames of the sequences in 
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phase to pemrit mRNA translation of the sequences into a single biologically active fusion 
protein. The resulting protein is huGM-CSF[Leu23Asp27Glu39]/huIL-3[Pro8Aspl5Asp70]. 
Alternatively, the 3' end of a DNA fragment encoding IL-3 may be ligated to the 5" end of the 
DNA fragment encoding GM-CSF, with the reading frames of the sequences in phase to 
5 permit mRNA translation of the sequences into a single biologically active fusion protein, 
yielding the protein huE.-3[Pro8Aspl5Asp70]/huGM-CSF[Leu23Asp27Glu39]. The regulatory 
elements responsible for transcription of DNA into mRNA are retained on the first of the two 
DNA sequences, while binding signals or stop codons, which would prevent read-through to 
the second DNA sequence, are eliminated. Conversely, regulatoiy elements are removed from 
10 the second DNA sequence MiMe stop codons required to end translation are retained. 

In preferred aspects of the present invention, means are provided for linking die GM- 
CSF and n--3 domains, preferably via a linker sequence. The linker sequence separates GM- 
CSF and IL-3 domains by a distance sufficient to ensure that each domain properly folds into 
its secondary and tertiary structores. Suitable linker sequences (1) will adopt a flexible 

15 extended conformation, (2) will not exhibit a propensity for developing an oidered secondary 
stracturc which could interact with the functional GM-CSF and IL-3 domains, and (3) wUl 
have minimal hydrophobic or charged character which could promote interaction with the 
functional protein domains. Typical surface amino acids in flexible protein regions include 
Gly, Asn and Ser. Virtually any permutation of amino add sequences containing Gly, Asn and 

20 Ser would be expected to satisfy die above criteria for a linker sequence. Otiier near neutral 
amino acids, such as Thr and Ala, may also be used in die linker sequence. 

The length of the linker sequence may vary widiout significantiy affecting the biological 
activity of die fusion protein. For example, die GM-CSF and IL-3 proteins may be directly 
fused witiiout a linker sequence. Linker sequences are unnecessary where die proteins being 

25 fused have non-essential N- or C-terminal amino acid regions which can be used to separate die 
functional domains and prevent steric interference. In one preferred embodiment of die present 
invention, die C-terminus of GM-CSF may be directiy fused to die N-temiinus of IL-3. GM- 
CSF has six amino acids following die C-terminal cysteine residue, which is involved in 
disulfide bonding and is essential for proper folding of the protein. 11^3 has 15 amino acids 

30 preceding its N-teiminal cysteine residue. The combined terminal regions dius may provide 
sufficient separation to render die use of a linker sequence unnecessary. 

Generally, the two protein domains will be separated by a distance approximately equal 
to die small unit dimension of GM-CSF or IL-3 (i.e., approximately 0.38 nm, as detennined 
by analogy widi similar four-helix hormones). In a preferred aspect of die invention, a linker 

35 sequence lengtii of about 1 1 amino acids is used to provide a suitable separation of functional 
protein domains, aldiough longer linker sequences may also be used. The lengdi of die linker 
sequence separating GM-CSF and IL-3 is from 1 to 500 amino acids in length, or more 
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preferably from 1 to 100 amino acids in length. In the most preferred aspects of the present 
invention, the linker sequence is from about 1-20 amino acids in length. In the specific 
embodiments disclosed herein, the linker sequence.is from about 5 to about 15 amino acids, 
and is advantageously fix}m about 10 to about 15 amino acids. Amino acid sequences useful as 
5 linkers of GM-CSF and IL-3 include, by way of example, (Gly4Ser)3 and Gly4SerGly5Ser. 

The linker sequence is incorporated into the fusion protein construct by well known 
standard methods of mutagenesis as described below. 

Proteins and Analogs 

0 The present invention provides a fusion protein comprising human GM-CSF and 

human ILr3, Derivatives of the fusion proteins of the present invention also include various 
stractural forms of the primary protein which retain biological activity. Due to the presence of 
ionizable amino and carboxyl groups, for example, a fusion protein may be in the form of 
acidic or basic salts, or may be in neutral form. Individual amino acid residues may also be 

5 modified by oxidation or reduction. 

The primary amino acid structure may be modified by fomaing covalent or aggregative 
conjugates with other chemical moieties, such as glycosyl groups, lipids, phosphate, acetyl 
groups and the like, or by creating amino acid sequence mutants. Covalent derivatives are 
prepared by linking particular functional groups to amino add side chains or at the N- or C- 

0 terminL Other derivatives of the frision protein within the scope of this invention include 
covalent or aggregative conjugates of the fusion protein with other proteins or polypeptides, 
such as by synthesis in recombinant culture as N- or C-terminal fusions. For example, the 
conjugated peptide may be a signal (or leader) polypeptide sequence at the N-taminal region of 
the protein which co-translationally or post-translationally directs transfer of the protein from 

5 its site of synthesis to its site of function inside or outside of the cell membrane or wall (e.g., 
the yeast a-factor leader). Peptides may also be added to facilitate purification or identification 
of GM-CSF/EL-3 fusion proteins (e.g., poly-His). The amino acid sequence of the fusion 
protein can also be linked to the peptide Asp-Tyr-Lys-Asp-Asp-Asp-Asp-Lys (DYKDDDDK) 
(Hopp et al., Bio/Technology 6:1204, 1988.) The latter sequence is highly antigenic and 

0 provides an epitope reversibly bound by a specific monoclonal antibody, enabling rapid assay 
and facile purification of expressed recombinant protein. This sequence is also specifically 
cleaved by bovine mucosal enterokinase at the residue immediately following the Asp-Lys 
pairing. Fusion proteins capped with this peptide may also be resistant to intracellular 
degradation in E. coIL 

5 Fusion protein derivatives may also be used as immunogens, reagents in receptor-based 

immunoassays, or as binding agents for affinity purification procedures of binding ligands. 
Derivatives may also be obtained by cross-linking agents, such as M-maleimidobenzoyl 
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succinimide ester and N-hydroxysuccinimide, at cysteine and lysine residues. Fusion proteins 
may also be covalenfly bound through reactive side groups to various insoluble substrates, 
such as cyanogen bromide-activated, bisoxirane-activated, carbonyldiimidazole-activated or 
tosyl-activated agarose structures, or by adsorbing to polyolefin surfaces (with or without 
5 glutaraldehyde cross-linking). 

The present invention also includes proteins with or without associated native-pattern 
gjycosylation. Expression of DNAs encoding the fusion proteins in bacteria such as £. coli 
provides non-glycosylated molecules. Functional mutant analogs having inactivated N- 
glycosylation sites can be produced by oligonucleotide synthesis and ligation or by site-specific 

10 mutagenesis techniques. These analog proteins can be produced in a homogeneous, reduced- 
carbohydrate form in good yield using yeast expression systems. N-glycosylation sites in 
eukaiyotic proteins are characterized by the amino acid triplet Asn-Ai-Z, where Ai is any 
amino acid except Pro, and Z is Ser or Thr. In tiiis sequence, asparagine provides a side chain 
amino group for covalent attachment of carbohydrate. Such a site can be eliminated by 

15 substituting anotiier amino acid for Asn or for residue Z, deleting Asn or Z, or inserting a non- 
Z amino acid between Ai and Z, or an amino acid otfier tiian Asn between Asn and Ai. 
Examples of human GM-CSF analogs in which glycosylation sites have been removed include 
huGM-CSF[Leu23\sp27Glu39], huGM-CSF[Leu23], huGM-CSF[Leu23Asp27], huGM- 
CSF[Glu39], huGM-CSF[Asp27Glu39], huGM-CSF[Leu23Glu39] and huGM-CSF[Asp27]. 

20 Examples of human IL-3 analogs in which glycosylation sites have been removed include 
huIL-3[Pro8Aspl5Asp70], huIL-3[Asp70], huIL-3[Aspl5Asp70]. huIL-3[Pro8Aspl5], huIL- 
3[Pro8Asp70], and huIL-3[Aspl5]. 

Derivatives and analogs may also be obtained by mutations of tiie fusion protein. A 
derivative or analog, as referred to herein, is a polypeptide in which die GM-CSF and IL-3 

25 domains are substantially homologous to full-length GM-CSF and IL-3 domains of tiie 
sequences disclosed in Figures 1 and 2 but which has an amino acid sequence difference 
attributable to a deletion, insertion or substitution. 

Bioequivalent analogs of fusion proteins may be constructed by, for example, making 
various substimtions of residues or sequences. For example, cysteine residues can be deleted 

30 or replaced witfi otiier amino acids to prevent formation of incorrect intramolecular disulfide 
bridges upon renaturation. Odier approaches to mutagenesis involve modification of adjacent 
dibasic amino acid residues to enhance expression in yeast systems in which KEX2 protease 
activity is present. Generally, substitutions should be made conservatively; i.e., die most 
preferred substitute amino acids are Uiose having physicochemical characteristics resembling 

35 tiiose of the residue to be replaced. Similarly, when a deletion or insertion strategy is adopted, 
the potential effect of die deletion or insertion on biological activity should be considered. 
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Mutations in nucleotide sequences constructed for expression of analogs must, of 
course, preserve the reading frame phase of the coding sequences and preferably will not create 
complementary regions that could hybridize to produce secondary mRN A stmctures such as 
loops or hairpins which would adversely affect translation of the GM-CSF/IL-3 receptor 
5 mRNA. Although a mutation site may be predetermined, it is not necessary that the nature of 
the mutation per se be predetermined. For example, in order to select for optimum 
characteristics of mutants at a given site, random mutagenesis may be conducted at the target 
codon and the expressed mutants screened for the desired activity. 

Not all mutations in nucleotide sequences which encode fusion proteins comprising 

10 GM-CSF and IL-3 will be expressed in the final product, for example, nucleotide substitutions 
may be made to enhance expression, primarily to avoid secondary structure loops in the 
transcribed mRNA (see EPA 75,444A, incorporated herein by reference), or to provide codons 
that are more readily translated by the selected host, e.g., the well-known £, coU preference 
codons for E. coli expression. 

15 Mutations can be introduced at particular loci by synthesizing oligonucleotides 

containing a mutant sequence, flanked by restriction sites enabling ligation to fragments of the 
native sequence. Following ligation, the resulting reconstructed sequence encodes an analog 
having the desired amino acid insertion, substimtion, or deletion. 

Alternatively, oligonucleotide-directed site-specific mutagenesis procedures can be 

20 employed to provide an altered gene having particular codons altered according to the 
substitution, deletion, or insertion required. Exemplary methods of making the alt^ations set 
forth above are disclosed by Walder et al. {Gene 42:133, 1986); Bauer et al, {Gene 37\1X 
1985); Craik {BioTechniques, January 1985, 12-19); Smith et al. {Genetic Engineering: 
Principles and Methods, Plenum Press, 1981); and U.S. Patent Nos, 4,518,584 and 

25 4,737,462, and are incorporated by reference herein. 

Expression of Recombinant Fusion Proteins Comprising GM-CSF and IL-3 

The present invention provides recombinant expression vectors which include 
synthetic or cDNA-derived DNA fi^gments encoding human fusion proteins comprising GM- 

30 CSF and IL-3 or bioequivalent analogs operably linked to suitable transcriptional or 
translational regulatory elements derived fi*om mammalian, microbial, viral or insect genes. 
Such regulatory elements include a transcriptional promoter, an optional operator sequence to 
control transcription, a sequence encoding suitable mRNA ribosomal binding sites, and 
sequences which control the termination of transcription and translation, as described in detail 

35 below. The ability to replicate in a host, usually conferred by an origin of replication, and a 
selection gene to facilitate recognition of transforaiants may additionally be incorporated. DNA 
regions are operably linked when they are functionally related to each other. For example. 
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DNA for a signal peptide (secretory leader) is operably linked to DNA for a polypeptide if it is 
expressed as a precursor which participates in the secretion of the polypeptide; a promoter is 
operably linked to a coding sequence if it controls the transcription of the sequence; or a 
ribosome binding site is operably linked to a coding sequence if it is positioned so as to permit 
5 translation. Generally, operably linked means contiguous and, in the case of secretory leaders, 
contiguous and in reading frame. 

Due to code degeneracy, there can be considerable variation in nucleotide sequences 
encoding the same amino acid sequence; exemplary DNA embodiments are those 
corresponding to the nucleotide sequences shown in Figures 1 or 2. Other embodiments 

10 include sequences commensurate in length with and capable of hybridizing to the sequences of 
Figures 1 or 2 under moderately stringent conditions (SO^C, 2 X SSC) and which encode 
biologically active fusion proteins. 

Transformed host cells are cells which have been transformed or transfected with 
fusion protein vectors constructed using recombinant DNA techniques. Transformed host cells 

15 ordinarily express the desired fusion protein, but host cells transformed for purposes of 
cloning or amplifying DNA do not need to express the protein. Expressed fusion protein will 
generally be secreted into the culture supernatant. Suitable host cells for ejqiression of fusion 
protein include prokaryotes, yeast or higher eukaryotic cells under the control of appropriate 
pronooters. Prokaryotes include gram negative or gram positive organisms, for exaniple E. coli 

20 or bacilli. Higher eukaryotic cells include established cell lines of mammalian origin as 
described below. Cell-free translation systems could also be employed to produce fusion 
protein using RNAs derived from the DNA constructs of the present invention. Appropriate 
cloning and expression vectors for use with bacterial, fungal, yeast, and mammalian cellular 
hosts are described by Pouwels et al. {Cloning Vectors: A Laboratory Manual, Elsevier, New 

25 York, 1985), the relevant disclosure of which is hereby incorporated by reference. 

Prokaryotic expression hosts may be used for expression of fusion protein that do not 
require extensive proteolytic and disulfide processing. Prokaryotic expression vectors genwally 
comprise one or more phenotypic selectable markers, for example a gene encoding proteins 
conferring antibiotic resistance or supplying an autotrophic requirement, and an origin of 

30 replication recognized by the host to ensure amplification within the host. Suitable prokaryotic 
hosts for transformation include E. coli, Bacillus subtilis. Salmonella typfumurium, and 
various species witiiin the genera Pseudomonas, Streptomyces, and Staphyolococcus, 
although others may also be employed as a matter of choice. 

Useful expression vectors for bacterial use can comprise a selectable marker and 

35 bacterial origin of replication derived from commercially available plasmids comprising genetic 
elements of the well-known cloning vector pBR322 (ATCC 37017). Such commercial vectors 
include, for example, pKK223-3 (Pharmacia Fine Chemicals, Uppsala, Sweden) and pGEMl 
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(Promega Biotec, Madison, WI, USA). These pBR322 "backbone" sections are combined 
with an appropriate promoter and the structural sequence to be expressed. E. coli is typically 
transformed using derivatives of pBR322, a plasmid derived ftom an E. coli species (Bolivar et 
al., Gene 2:95, 1977). pBR322 contains genes for ampicillin and tetracycline resistance and 
5 thus provides siinple means for identifying transformed cells. 

Promoters commonly used in recombinant microbial expression vectors include the b- 
lactamase (penicillinase) and lactose promoter system (Chang et al.. Nature 275:615, 1978; and 
Goeddel et al.. Nature 257:544, 1979), the tryptophan (trp) promoter system (Goeddel et al,, 
NucL Acids Res. 5:4057, 1980; and EPA 36,776) and tac promoter (Maniatis, Molecular 

10 Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, p. 412, 1982). A particularly 
useful bacterial expression system employs the phage X Pl promoter and cI857ts 
thermoinducible repressor. Plasmid vectors available from the American Type Culture 
Collection which incorporate derivatives of the X Pl promoter include plasmid pHLFB2, 
resident in E. coli strain JMB9 (ATCC 37092) and pPLc28, resident in £. coli RRl (ATCC 

15 53082). 

Recombinant fusion proteins may also be expressed in yeast hosts, preferably from the 
Saccharomyces species, such as 5. cerevisiae. Yeast of other genera such as Pichia or 
Kluyveromyces may also be employed. Yeast vectors will generally contain an origin of 
replication from the 2m yeast plasmid or an autonomously replicating sequence (ARS), 

20 promoter, DNA encoding the fusion protein, sequences for polyadenylation and transcription 
termination and a selection gene. Preferably, yeast vectors will include an origin of rq)lication 
and selectable marker permitting transformation of both yeast and £. coli, e.g., the ampicillin 
resistance gene of £. coli and S. cerevisiae trpl gene, which provides a selection marker for a 
mutant strain of yeast lacking the ability to grow in tryptophan, and a promoter derived finom a 

25 highly expressed yeast gene to induce transcription of a structural sequence downstream. The 
presence of the trpl lesion in the yeast host cell genome then provides an effective environment 
for detecting transformation by growth in the absence of tryptophan. 

Suitable promoter sequences in yeast vectors include the promoters for metallothionein, 
3-phosphoglycerate kinase (Hitzeman et al., /. BioL Chem. 255:2073, 1980) or other 

30 glycolytic enzymes (Hess et al., /. Arfv. Enzyme Reg. 7:149, 1968; and Holland et al., 
Biochem. 77:4900, 1978), such as enolase, glyceraldehyde-3-phosphate dehydrogenase, 
hexokinase, pyruvate decarboxylase, phosphofructokinase, glucose-6-phosphate isomerase, 3- 
phosphoglycerate mutase, pyruvate kinase, triosephosphate isomerase, phosphoglucose 
isomerase, and glucokinase. Suitable vectors and promoters for use in yeast expression are 

35 further described in R. Hitzeman et al., EPA 73,657. 

Preferred yeast vectors can be assembled using DNA sequences from pBR322 for 
selection and replication in £. coli (Amp^ gene and origin of replication) and yeast DNA 
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sequences including a gjucose-repressible ADH2 promoter and a-factor secretion leader. The 
ADH2 promoter has been described by RusseU et al. (/. Bhl. Chem. 258:2614, 1982) and 
Beier et al. (Nature 300:724, 1982). The yeast a-factor leader, which directs secretion of 
heterologous proteins, can be inserted between the promoter and the structural gene to be 
5 expressed. See, e.g.. Kurjan et al.. Cell 30:933, mi; and Bitter et al., Proc. Natl. Acad. Sci. 
USA Si:5330, 1984. The leader sequence may be modified to contain, near its 3' end, one or 
more useful restriction sites to facilitate fusion of the leader sequence to foreign genes. 

Suitable yeast transformation protocols are known to those of skill in the art; an 
exemplary technique is described by Hinnen et al., Proc. Natl. Acad. Sci. USA 75:1929, 
10 1978, selecting for Trp+ transformants in a selective medium consisting of 0.67% yeast 
nitrogen base, 0.5% casamino acids, 2% glucose, 10 ug/ml adenine and 20 ^ig/ml uiacU. 

Host strains transformed by vectors comprising the ADH2 promoter may be grown for 
expression in a rich medium consisting of 1% yeast extract, 2% peptone, and 1% glucose 
supplemented with 80 ^lg/ml adenine and 80 ng/ml uracil. Derepression of the ADH2 promoter 
15 occurs upon exhaustion of medium glucose. Crude yeast supematants are harvested by 
filtration and held at 4*'C prior to further purification. 

Various mammalian or insect cell culture systems can be employed to express 
recombinant protein. Baculovirus systems for production of heterologous proteins in insect 
ceUs are reviewed by Luckow and Summers, Bio/Technology 6:41 (1988). Examples of 
20 suitable mammalian host cell Hnes include the COS-7 lines of monkey kidney cells, described 
by Gluzman (Cell 23:115, 1981). and other cell lines capable of expressing an appropriate 
vector including, for example, L cells, C127, 3T3, Chinese hamster ovaiy (CHO), HeLa and 
BHK cell lines. Mammalian expression vectors may comprise non-transcribed elements such 
as an origin of replication, a suitable promoter and enhancer Unked to the gene to be expressed, 
25 and other 5' or 3' flanking nontranscribed sequences, and 5' or 3' nontranslated sequences, 
such as necessary ribosome binding sites, a poly-adenylation site, splice donor and acceptor 
sites, and transcriptional termination sequences. 

The transcriptional and iranslational control sequences in expression vectors to be used 
in transforming vertebrate cells may be provided by viral sources. For example, commonly 
30 used promoters and enhancers are derived from Polyoma, Adenovirus 2, Simian Virus 40 
(SV40), and human cytomegalovirus. DNA sequences derived fmm the SV40 viral genome, 
for example, S V40 origin, eariy and late promoter, enhancer, splice, and polyadenylation sites 
may be used to provide the otiier genetic elements required for expression of a heterologous 
DNA sequence. The early and late promoters are particularly useful because both are obtained 
35 easily fiiom tiie virus as a fiagment which also contains the SV40 viral origin of replication 
(Fiers et al.. Nature 275:113, 1978). Smaller or larger SV40 fragments may also be used, 
provided tiie approximately 250 bp sequence extending from the Hind IH site toward die BgR 
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site located in the viral origin of replication is included Exemplary vectors can be constructed 
as disclosed by Okayama and Berg (MoL Cell Biol 3:280, 1983). 

A useful system for stable high level expression of mammalian receptor cDNAs in 
C127 murine mammary epithelial cells can be constructed substantially as described by 
5 Cosman et aL (MoL Immunol 25:935, 1986). 

Particularly prefeired eukaryotic vectors for expression of GM-CSF/IL-3 DNA include 
pIXY321 and pIXY344, both of which are yeast expression vectors derived from 
pBC102.K22 (ATCC 67,255) and contain DNA sequences fix>m pBR322 for selection and 
replication in E, coli (Apr gene and origin of replication) and yeast, as described below in 

10 Examples 1 and 7. 

Purified mammalian fusion proteins or analogs are prepared by culturing suitable 
hosWector systOTis to e3q)ress the recombinant translation products of die DNAs of the present 
invention, which are then purified from culture media or cell extracts. 

For example, supematants from systems which secrete recombinant protein into 

15 culture media can be first concentrated using a commercially available protein concentration 
filter, for example, an Amicon or Millipore Pellicon ultrafiltration unit Following the 
concentration step, the concentrate can be applied to a suitable purification matrix. For 
example, a suitable affinity matrix can comprise a GM-CSF or IL-3 receptor or lectin or 
antibody molecule bound to a suitable support Alternatively, an anion exchange resin can be 

20 employed, for example, a matrix or substrate having pendant diethylaminoethyl (DEAE) 
groups. The matrices can be acrylamide, agarose, dextran, cellulose or other types commonly 
employed in protein purification. Alternatively, a cation exchange step can be employed. 
Suitable cation exchangers include various insoluble matrices comprising sulfopropyl or 
carboxymethyl groups. Sulfopropyl groups are preferred. 

25 Finally, one or more reversed-phase high performance liquid chromatography (RP- 

HPLC) steps employing hydrophobic RP-HPLC media, e.g., silica gel having pendant methyl 
or other aliphatic groups, can be employed to further purify a fusion protein composition. 
Some or all of the foregoing purification steps, in various combinations, can also be employed 
to provide a homogeneous recombinant protein. 

30 Recombinant protein produced in bacterial culture is usually isolated by initial extraction 

from cell pellets, followed by one or more concentration, salting-out, aqueous ion exchange or 
size exclusion chromatography steps. Finally, high performance liquid chromatography 
(HPLC) can be employed for final purification steps. Microbial cells employed in expression 
of recombinant fusion proteins can be disrupted by any convenient method, including freeze- 

35 thaw cycling, sonication, mechanical disruption, or use of cell lysing agents. 

Fermentation of yeast which express fusion proteins as a secreted protein greatly 
simplifies purification. Secreted recombinant protein resulting from a large-scale fermentation 
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can be purified by methods analogous to those disclosed by Urdal et al. (7. Chromatog. 
296:171, 1984), This reference describes two sequential, reversed-phase HPLC steps for 
purification of recombinant murine GM-CSF on a pipparative HPLC column. 

Fusion protein syntiiesized in recombinant culture is characterized by die presence of 
5 non-human cell components, including proteins, in amounts and of a character which depend 
upon the purification steps taken to recover the fusion protein from the culture. These 
components ordinarily wiU be of yeast, prokaryotic or non-human higher eukaiyoric origin and 
preferably are present in innocuous contaminant quantities, on tiie oider of less tiian about 5 
percent by scanning densitometry or chromatography. Further, recombinant cell culture 

10 enables the production of the fusion protein free of proteins which may be nramally associated 
witii GM-CSF or IL-3 as they are found in nature in their respective species of origin, e.g., in 
cells, cell exudates or body fluids. 

Fusion protein compositions are prepared for administration by mixing fiision protein 
having die desired degree of purity with physiologically acceptable carriers. Such carriers will 

15 be nontoxic to recipients at the dosages and concentrations employed. Ordinarily, the 
preparation of such compositions entails combining the fusion protein with buffers, 
antioxidants such as ascorbic acid, low molecular weight (less tiian about ten residues) 
polypeptides, proteins, amino acids, carbohydrates including glucose, sucrose or dextrins, 
chelating agents such as EDTA, glutatfiione and other stabilizers and excipients. 

20 Fusion protein conpositions may be used to enhance proliferation, differentiation and 

functional activation of hematopoietic progenitor ceils, such as bone marrow ceUs. 
Specifically, compositions containing the fusion protein may be used to increase peripheral 
blood leukocyte numbers and increase circulating granulocyte counts in myelosuppressed 
patients. To achieve this result, a therapeutically effective quantity of a fusion protein 

25 composition is administered to a mammal, preferably a human, in association witii a 
pharmaceutical carrier or diluent 

The following examples are offered by way of illustration, and not by way of 
limitation. 
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Example 1 

Synthesis of cDNAs Encoding GM-CSF/IL-3 Fusion Protein 

5 

A. Isolation of cDNA encoding huTL-3 . Peripheral blood lymphocytes were isolated 
from buffy coats prepared from whole blood (Portland Red Cross, Portland, Oregon, USA) by 
FicoU hypaque density centrifugation. T cells were isolated by rosetting with 2-amino- 
ethylthiouronium bromide-treated sheep red blood cells. Cells were cultured in 175 cn^ flasks 

10 at 5 X 106 cells/ml for 18 hour in 100 ml RPMI, 10% fetal calf serum, 50 jiM b- 
mercaptoethanol, 1% phytohemagglutinin (PHA) and 10 ng/ml phorbol 12-myristate 13-acetate 
(PMA). RNA was extracted by the guanidinium CsCl method and poly A+ RNA prepared by 
oligo-dT cellulose chromatography (Maniatis et al.. Molecular Cloning: A Laboratory Manual, 
Cold Spring Harbor, 1982). cDNA was prepared from poly A"*" RNA essentially as described 

15 by Gubler and Hoffman, Gene 25:263-269 (1983). The cDNA was rendered double-stranded 
using DNA polymerase I, blunt-ended with T4 DNA polymerase, methylated with EcoRl 
methylase to protect EcoRl cleavage sites within the cDNA, and ligated to EcoRl linkers. 
These constructs were digested with EcoRl to remove all but one copy of the linkers at each 
end of the cDNA, ligated to EcoRl-cut and dephosphorylated arms of phage XgtlO (Huynh et 

20 al., DNA Cloning: A Practical Approach, Glover, ed., IRL Press, pp. 49-78) and packaged 
into X phage extracts (Stratagene, San Diego, CA, USA) according to the manufacturef s 
instructions. 500,000 recombinants were plated on £. coli strain C600hfl- and screened by 
standard plaque hybridization techniques using the following probes. 

Two oligonucleotides were synthesized, with sequences complementary to selected 5' 

25 and 3' sequences of the huIL-3 gene. The 5* probe, complementary to a sequence encoding 
part of the huIL-3 leader, had the sequence 5'-GAGTTGGAGCAGGAGCAGGAC-3'. The 3' 
probe, corresponding to a region encoding amino acids 123-130 of the mature protein, had the 
sequence 5 -GATCGCGAGGCTCAAAGTCGT-3'. The method of synthesis was a standard 
automated triester method substantially similar to that disclosed by Sood et al., NucL Acids 

30 Res. 4:2557 (1977) and Hirose et al.. Tet. Lett. 28:2449 (1978). Following synthesis, 
oligonucleotides were deblocked and purified by preparative gel electrophoresis. For use as 
screening probes, the oligonucleotides were terminally radiolabeled with 32p-ATP and T4 
polynucleotide kinase using techniques similar to those disclosed by Maniatis et al. The £, coli 
strain used for library screening was C600hfl- (Huynh et al., 1985, supra). 

35 Thirteen positive plaques were purified and re-probed separately with the two 

hybridization probes. Eleven clones hybridized to both oligonucleotides. The cDNA inserts 
from several positive recombinant phage were subcloned into an EcoRl-cut derivative 
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(pGEMBLlS) of the standard cloning vector pBR322 containing a polylinker having a unique 
EcoRl site, a BamHl site and numerous other unique restriction sites. An exenq}laiy vector of 
this type, pGEMBL, is described by Dente et al., Nucl. Acids Res. 77:1645 (1983), in which 
the promoters for SP6 and T7 polymerases flank the multiple cloning sites. The nucleotide 
5 sequences of selected clones were determined by the chain termination method. Specifically, 
partial EcoRl digestion of X,GT10:IL-3 clones 2, 3, 4 and 5 yielded fragments ranging fiom 
850 bp to 1,000 bp in size which were separately subcloned into the EcoRl site of 
pGEMBLlS. The inserts of rfie pGEMBL:ihuIL-3 subclones were sequenced using a 
universal primra- that binds adjacent to the multiple cloning site of pGEMBLlS, and syntiietic 
10 oligonucleotide primers derived from the huIL-3 sequence. 

B. Modification of N-GlvR nsvlation Sites Encoded hv huIL-3 cDNA and Assembly nf 
Expression Vector for rhuH..^ n>m8 Acpl5 Agp70^ The two asparagine-linked glycosylation 
sites present in tiie natural protein (Asnl5 and Asn^O) were altered by changing tiie codons at 

15 these positions to ones tiiat encode aspartic acid. This prevents N-linked glycosylation (often 
hyperglycosylation) of tiie secreted protein by the yeast cells, and a more homogeneous 
product is obtained. These changes were made as described below upon subcloning the huIL- 
3 cDNA into tiie yeast expression vector pIXY120. 

The yeast expression vector pIXY120 is substantially identical to pBC102-K22, 

20 described in EPA 243,153, except Uiat the following syntiietic oligonucleotide containing 
multiple cloning sites was inserted from the Asp718 site (amino acid 79) near tiie 3' end of die 
a-factor signal peptide to the Spel site contained in the 2ti sequences: 

Asp718 /Ncol 
GTACCTTTGGATAAAAGAGaCTACaAGGACGRCGATGACAAGAGGCCTCCATGGATCCCCCGGGACA 

GAAACCTATTTTCTCTGATGTTCCTGCTGCTACTGTTCTCCGGAGGTACCTAGGGGGCCCTGTGATC 

/BamHl Spel 

In addition, a 514-bp DNA fragment derived from die single-stranded bacteriophage fl 
30 containing tiie origin of replication and intergenic region was inserted at die Nrul site in tiie 
pBR322 DNA sequences. The presence of tiie fl origin of replication enables generation of 
single-stranded copies of tiie vector when transfonned into appropriate (male) strains of E. coli 
and superinfected with bacteriophage f 1. This capabUity facilitates DNA sequencing of tiie 
vectOT and allows the possibility of in vitro mutagenesis. 
35 The yeast expression vector pIXY120 was digested witii tiie restriction enzymes 

Asp718, which cleaves near tiie 3' end of die a-factor leader peptide (nucleotide 237), and 
BamHl, which cleaves in die polylinker. The large vector fragment was purified and ligated to 
tiie following DNA fragments: (1) a huIL-3 cDNA fragment derived from plasmid 
GEMBL18:huIL-3 from tiie Clal site (nucleotide 58 of mature huIL-3) to die BamHl site (3' 
40 to tiie huIL-3 cDNA in a polylinker); and (2) tiie following syntiietic oligonucleotide linker A: 
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GTA CCT TTG GAT AAA AGA GAG TAG AAG GAG GAG GAT GAG AAG GCT CCC ATG ACC GAG 

GA AAG GTA TTT TCT GTG ATG TTG CTG CTG GTA CT(3 TTC CGA GGG TAG TGG GTC 

5 AC£ ACG CCC TTG AAG ACQ AGC TGG GTT'fiAX TGC TCT AAC ATG AT 

TGC TGC GGG AAC TTC TGG TGG ACC CAA GTA ACG AGA TTG TAG TAG C 

Oligonucleotide A regenerates the sequence encoding the C-terminus of the a-factor leader 
peptide and fusing it in-ftame to the octapeptide DYKDDDDK, which is, in turn, fused to the 

10 N-terminus of mature rhuIL-3. This fusion to the rhuIL-3 protein allows detection with 
antibody specific for the octapeptide and was used initially for monitoring the expression and 
purification of rhuIL-S, This oligonucleotide also encodes an amino acid change at position 15 
(Asnl5 to Aspl5) 10 alter this N-linked glycosylation site. The underlined nucleotides in 
oligonucleotide A represent changes from the wild type cDNA sequence. Only the A to G and 

15 C to T changes at nucleotides 43 and 45, respectively (counting from the codon coiresponding 
to the N-terminal alanine of the mature huIL-3 molecule), result in an amino acid change 
(Aspl5). The other base changes introduce convenient restriction sites (Ahall and PvuU) 
without altering the amino acid sequence. The resulting plasmid was designated pIXY139 and 
contains a rhuIL-3 cDNA with one remaining N-linked glycosylation consensus sequence 

20 (Asn70). 

Plasmid pIXY139 was used to perform oligonucleotide-directed mutagenesis to remove 
the second N-linked glycosylation consensus sequence by changing Asn'^^ to Asp*^^. The in 
vitro mutagenesis was conducted by a method similar to that described by Walder and Walder, 
Gene 42:133 (1986). The yeast vector, pIXY139, contains the origirr of replication for the 

25 single-stranded bacteriophage fl and is capable of generating single-stranded DNA when 
present in a suitable (male) strain of E, coli and superinfected with helper phage. 

Single-stranded DNA was generated by transforming £. coli strain JM107 and 
superinfecting with helper phage IRl. Single-stranded DNA was isolated and annealed to the 
following mutagenic oligonucleotide B, GTC AAG AGT TTA CAG fiAC GC/< TCA GCA 

30 AAT G, which provides a codon switch substituting Asp for Asn at position 70 of mature 
huIL-3. Annealing and yeast transforaiation conditions were done as described by Walder and 
Walder, supra. Yeast transfonnants were selected by growth on medium lacking tryptophan, 
pooled, and DNA extracted as described by Holm et al.. Gene 42:169 (1986). This DNA, 
containing a mixture of wild type and mutant plasmid DNA, was used to transform E. coli RRl 

35 to ampicillin resistance. The resulting colonies were screened by hybridization to radiolabeled 
oligonucleotide B using standard techniques. Plasmids comprising DNA encoding huIL-3 
Asp^^ were identified by the hybridization to radiolabeled oligonucleotide B under stringent 
conditions and verified by nucleotide sequencing. 
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The resulting yeast expression plasmid was designated pIXY138, and contained die 
huIL-3 gene encoding the Aspl5 Asp^O amino acid changes and the octapeptide DYKDDDDK 
at the N-tenninus. The final yeast expression plasmid is identical to pIXY138 except that it 
lacks the nucleotide sequences coding for the octapeptide, tiius generating mature rhuIL-3 as 
5 the product 

The final yeast expression plasmid was constructed as described below. The yeast 
expression vector pIXY120 was cleaved witii die restriction enzymes Asp718 and BamHl as 
described above. The large vector fragment was ligated togetiier with (1) a huIL-3 cDNA 
fragment derived fiiom plasmid pIXY138 tiiat extended from the Aha2 site (which cleaves a 
10 nucleotide 19 of mature huIL-3) to die BamHl site 3' to the cDNA, and (2) die following 
synthetic oligonucleotide C: 



GTA CCT TTG GAT AAA AGA GCT CCC ATG ACC CAG ACG A 

GA AAC CTA TTT TCT CGT GGG TAG TGG GTC TGC TGC 
Pro Leu Asp Lys Arg Ala Pro Met Thr Gin Thr Thr 



Oligonucleotide C regenerates the 3' end of the a-factor leader peptide from the Asp718 site 
(the amino acids Pro-Leu-Asp-Lys-Arg) and the N-terminal seven amino acids of huIL-3 to the 
Ahall site. The resulting plasmid was designated pIXYlSL This vector, when present in 
20 yeast, allows glucose-regulated expression and secretion of rhuIL-3 (Pro^ Asp^^ Asp'^O), 

C. Expression Vector for rhuGM-CSF (Leu2 3 , Asp2 7, Glu3 9) Containing Modified N- 
Glycosylarion Sitps. The wild-type gene coding for human GM-CSF, resident on plasmid 
pHG23, has been deposited with the American Type Culture Collection (ATCC), 12301 

25 Parklawn Drive, Rockville, Maryland 20852, USA, under accession number 39900. The 
wUd-type gene inserted in a yeast expression vector, pYafHuGM, has also been deposited 
with die ATCC under accession number 53157. In order to provide a non-glycosylated analog 
of human GM-CSF, oligonucleotide-directed site-specific mutagenesis procedures were 
employed to eliminate potential N-glycosylation sites, as described in PCT publication WO 

30 89/03881. A plasmid encoding this analog, huGM-CSF (Leu^^ Asp27 Glu^^), was deposited 
witfi die ATCC as plasmid L207-3 in £. coli strain RRl under accession number 67231. 



D. Construction of Express ion Vector for GM-CSF/IL-3 Fusion Protein . In older to 
create a secretion vector for expressing a fusion construct having human GM-CSF and human 
35 IL-3 separated by a linker sequence, a precursor plasmid was first constructed by directiy 
fusing DNAs encoding GM-CSF and IL-3 togetfier without regard to reading fi-ame or 
intervening sequences. A cDNA fragment encoding nonglycosylated human GM-CSF was 
excised from plasmid L207-3 as a 977bp restriction fragment (Sphl to Sspl). The IL-3 cDNA 
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was excised from pIXYlSl by digestion with Asp718, which was then blunt ended using the 
T4 polymerase reaction of Maniatas et al. (Molecular Cloning: A Laboratory Manual, Cold 
Spring Harbor, 1982, p. 118) and further digested with Xhol giving an 803 bp fragment. 
These two fragments were then directly ligated to a pIXYlSl vector fragment cut with Sphl 
5 and Xhol, This plasnrid was called GM/IL-3 direct fusion. 

The GM/IL-3 direct fusion plasmid was used as a template in oligonucleotide-directed 
mutagenesis using methods similar to those described by Walder and Walder, sipra. The 
following oligonucleotide was then synthesized 

10 GCCAGTCCAGGAGGGTGGCGGTGGATCCGGCGGTGGTGGATCTGGTGGCGGCGGCTCAGCTCCCATGACCC 

ProValGlnGluGlyGlyGlyGlySerGlyGlyGlyGlySerGlyGlyGlyGlySerAlaProMetThr 
GM-CSF >< Linker X il-3 

15 This oligonucleotide overlaps the 3' end of GM-CSF by 13 bp but does not include the stop 
codon, contains the Gly Ser linker, and overlaps the 5' end of IL-3 by 13 bp. The linker 
sequence was a modified version of the linker described by Huston et al. (Proc. Natl Acad. 
ScL USA 85:5879-5883, 1988) but was optimized for codon usage in yeast as per Bennetzen 
et al. (/, BioL Chem. 257:3026, 1982). 

20 Single stranded plasmid DNA was made from the GM/IL-3 direct fusion using R408 

helper phage (Stratagene) and the methods of Russel et al, (Gene 45:333-338, 1986). 
Oligonucleotide directed mutagenesis was then carried out by annealing the above 
oligonucleotide to the single stranded plasmid DNA and transforming yeast strain XV2181 
with annealed DNA as described by Walder and Walder, supra. The yeast vector contains the 

25 origin of replication for the single stranded bacteriophage f 1 and is capable of sponsoring 
single stranded DNA production when present in a suitable (male) strain of £. coU and 
superinfected with helper phage. Yeast transformants were selected by growth on medium 
lacking tryptophan, pooled, and DNA was exo^cted as described by Holm et al. (Gene 42:169, 
1986). This DNA, containing a mixture of mutant and wild type plasmid DNA, was used to 

30 transform £. coli RRl to ampicillin resistance. The resulting colonies were screened by 
hybridization to radiolabeled oligonucleotide using standard techniques. Plasmids comprising 
DNA encoding GM-CSF/linker/IL-3 were identified by their hybridization to radiolabeled 
oligonucleotide containing the linker under stringent conditions and verified by nucleotide 
sequencing. 

35 During nucleotide sequencing it was discovered that a mutation had occurred within the 

linker region. The nucleotide sequence TGGTGGATCTGG was deleted (see sequence), 
resulting in the expression of a protein in which the sequence of amino acids GlyGlySerGly 
were deleted. This mutation did not change the reading frame or prevent expression of a 
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biologically active protein. The resulting plasmid was designated pr^321 and expressed the 
fusion protein huGM-CSF[Leu23Asp27Glu39]/Gly4Sei<}ly5Ser/hun iPix)8Aspl5Asp70]. 

E. Expression a nd Purificarion of GM-CSF/IL-3 Fusion Protein 
5 The host strain, XV2181, a diploid 5. cerevisiae strain, was formed by mating XV617- 

1-3B [a, his6, leu2-l, trpl-l, ura 3, steS], obtained from the University of Washington, 
Department of Genetics Yeast Strain Bank, Seattle, WA, USA, and X2181-lB-[a, trpl-l, gall, 
adel, his2], obtained from the Yeast Genetic Stock Center, University of California, Berkeley, 
CA, USA. The host strain was transformed with the expression plasmid by the method of 

10 Shennan et al.. Laboratory Course Manual for Methods in Yeast Genetics, Cold Spring 
Harbor Laboratory, 1986. 

Yeast containing the expression plasmid pIXY321 (see ID, above) was maintained on 
YNB-trp agar plates stored at 4°C A preculture was started by inoculating several isolated 
recombinant yeast colonies into one liter of YNB-trp medium (6.7 g/L Yeast Nitrogen Base, 5 

15 g/L casamino acids, 40 mg/L adenine, 160 mg/L uracil, and 200 mg/L tyrosine), and was 
grown overnight in two 2-liter flasks at 30°C with vigorous shaking. By morning the culture 
was saturated, in stationary phase, at an ODeoo of 2 to 7. The fennenters (three machines of 
10 liter working volume), previously cleaned and sterilized, were filled to 80% of their 
working capacity with SD-2 medium (4.0 g/L ammonium sulfate, 3.2 g/L monobasic 

20 potassium phosphate, 3.0 g/L yeast extract, 1.0 g/L citric acid, 0.1 g/L sodium chloride, 5 
ml/L 2% calcium chloride, 2.5 ml/L vitamin 101 solution, 0.5 ml/L trace elements solution, 0.5 
ml/L 20% magnesium sulfate, 2.0 ml/L glucose) and maintained at 30°C with 500-600 rpm 
agitation and 10-16 1pm aeration. The inoculum was added. After two hours of growtii a 
nutrient feed of 50% glucose was begun at a rate such that 50 g/L is added over a period of 10- 

25 12 hours. The nutrient feed was tiien shifted to 50% ethanol added at 30-40 ml/hr until 
harvest 

Total elapsed time of fermentation was approximately 20 hours, after which optical 
density (600 nm) ranged finom 30 to 45. The fennenters were tiien cooled to 20''C, pH of die 
yeast beer was adjusted to 8,0 by tiie addition of 5 M NaOH, and die resulting material filtered 

30 tiffough a Millipore Pellicon filter system equipped with a 0.45 pm filter cassette, and collected 
in a sterile 10 L carboy. 

One liter of yeast supernatant containing GM-CSF/I1^3 fiision protein was concentrated 
to 50 ml on an Amicon YM-10 membrane. The yeast brotii concentrate was tiien further 
purified by preparative HPLC by applying to a 1 cm X 25 cm column packed witii 5|i C-18 

35 silica (Vydac, Separations Group, Hesperia, CA, USA) that was equilibrated in 0.1% 
trifluoroacetic acid in water (Solvent A) prior to application of the yeast concentrate. 
Alternatively, die crude yeast broth can be pumped directiy on to the C-18 column. Following 
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application of the material, the column was flushed with Solvent A until the optical absorbance 
of the effluent approached base line values. At this time a gradient of 0.1% trifluoroacetic acid 
in acetonitrile (Solvent B) was established from 0% B to 100% B at a rate of change of 1-2% B 
per nrinute and at a flow rate of 2 ml/minute. One minute fractions were collected Aliquots of 
5 the fractions were analyzed for protein content by dot blot witii a rabbit polyclonal antisera to 
IL-3. GM-CSF/IL-3 eluted in fraction 50 at approximately 50% acetonitrile. 

HPLC fractions which were positive for GM-CSF/IL-3 fusion protein by dot blot were 
pooled and bound to SP-Sepharose in 20 mM p-alanine, pH 4. Fusion protein was eluted 
witii 0.5 M NaCl, 100 mM Tris-HCl, pH 8. Fractions containing fusion protein were 

10 identified by SDS-PAGE. 

The ion exchange fractions containing protein having a molecular weight of 35,000 
were pooled, concentrated to 100 m1 and further purified by FPLC gel filtration on a Superose 
12 column. The column was eluted witfi PBS. Fractions containing only the purified 35,000 
MW fiision protein were identified by SDS-PAGE. 

15 The biological activities (units/mg) and binding affinities of the GM-CSF/IL-3 prepared 

substantially as described above were determined as set forth in Examples 4 and 5. 

Example 4 

Biological Activity of GM-CSF/IL-3 Fusion Protein in Thvmidine Incorporation Assav 

20 

In order to detennine its level of biological activity, the GM-CSF/IL-3 fusion protein 
prepared as described in Example 3 was assayed for ability to stimulate proliferation of AML- 
193 cells in a thymidine incorporation assay. The AML-193 cell line is a GM-CSF dependent 
human monocytic leukemia cell line originally described by Santoli et al. (/. ImmunoL 

25 759:3348, 1987). The cells were grown in Iscove's Modified Dulbecco's Media (IMDM) with 
25 mM HEPES, 200 nM L-glutamine, 5 ^ig/ml insulin, 5 jig/ml transferrin, 5 ng/ml sodium 
selenite, 2.5% heat inactivated fetal bovine serum, antibiotics, and 5 ng/ml of purified 
recombinant human GM-CSF. The cells were split twice weekly and were seeded into fresh 
medium at a density of 300,000/ml. 

30 A thyuMdine incorporation assay was employed to examine the capacity of known 

growth factors and unknown supematants to stimulate proliferation of AML-193. AML-193 
cells were washed by centrifugation and resuspended in assay medium composed of IMDM as 
above except that fetal calf serum and/or GM-CSF was not included. Pure GM-CSF, IL-3 or 
GM-CSF/IL-3 fiision protein was added to the first well of a 96 well flat bottom tissue culture 

35 plate at a final concentration of 400 ng/ml in 50 ml of medium. These samples were then 
serially diluted by 3-fold through the additional 1 1 wells of the microtitre plate. Fifty ^1 of 
medium containing 3750 AML-193 cells was added to each well and plates were incubated at 
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3TC for 138 hours in a fiilly humidified atmosphere of 6% CO2 in air. Tritiated thymidine 
(0.5 mO/well) was added to each well for an additional 6 hours of incubation and the samples 
were harvested with an automated sample harvester and counted by liquid scintillation. One 
unit of activity is defined as the amount of growth factor required to stimulate half-maximal 
5 thymidine incoiporation. 

Simultaneous titration of IL-3, GM-CSF or GM-CSF/IL-3 fusion protein at identical 
concentrations revealed that the fusion protein was a more potent proliferation stimulus than 
eitiier factor alone or IL-3 and GM-CSF combined. The specific activity of the IL-3, GM-CSF 
and GM-CSF/IL-3 fusion protein is set forth in Table A, below. 

10 

TABLE A 



Moigcwig Specific Acrjyjty 

15 IL-3 1.65 X 105 

GM-CSF 9,74 x 10^ 

IL-3 + GM-CSF 1.39 x 10^ 
GM-CSF/IL-3 L81 x 10^ 



20 

The specific activity of GM-CSF/IL-3 fusion protein is approximately 10-fold higher tiian IL-3 
or GM-CSF alone or GM-CSF plus IL-3 combined. 

Example 5 

25 Binding Activitv of GM- CSF/IL-3 Fusion Protein in EQuilibrium Binding Assav 

Binding affinities of human IL-3, GM-CSF and fusion protein for receptors on human 
cells lines were determined by inhibition of 125i.iabeled IL-3 or GM-CSF binding, 

A. Radiolabeling of GM-CSF and IL-3, Recombinant human GM-CSF/IL-3 fusion 

30 protein was expressed in yeast cells and purified substantially as described above. 
Recombinant human IL-3 and GM-CSF, engineered to contain tiie octapeptide D YKDDDDK 
were expressed in yeast and purified using a monoclonal antibody specific to the octapeptide 
substantially as described in Hopp et al. (Bio/Technology 6:1204, 1988). The purified GM- 
CSF and IL-3 proteins were radiolabeled using a commercially available enzymobead 

35 radioiodination reagent (BioRad), substantially as described by Park et al. (J. Biol, Chem. 
261:4177, 1986). Briefly, 2-10 jig of recombinant protein in 50 pi 0,2 M sodium phosphate, 
pH 7,2, was combined with 50 \d enzymobead reagent, 2 mCi of sodium iodide in 20 ^1 of 
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0.05 M sodium phosphate pH 7 and 10 ^1 of 2,5% p-D-glucose. After 10 ram at 25''C, sodium 
azide (10 ^1 of 50 mM) and sodium metabisulfite (10 of 5 mg/ml) were added sequentially 
and incubation continued for 5 minutes at 25^C. The reaction mixture was fractionated by gel 
filtration on a 2 ml bed volume of Sephadex G-25 (Sigma) equilibrated in Roswell Park 
5 Memorial Institute (RPMI) 1640 medium containing 2.5% (w/v) bovine serum albumin (BSA). 
0.2% (w/v) sodium azide and 20 mM Hepes, pH 7.4 (binding medium). The final pools of 
and I^Sj-GM-CSF were diluted to a working stock solution of 1 x 10^7 M in binding 
medium and stored for up to one month at 4°C without detectable loss of receptor binding 
activity. The specific activity of radiolabeled preparation of GM-CSF is routinely in the range 
10 of 1-5 X 10^5 q)in/mmole. The specific activity of IL-3 is in the range of 3-6 x 10*5 
cpm/mmole. 

B. Binding Assays. Binding assays were performed using JM-1, KG-1, HL-60 and 
AML-193 cells • JM-1, HL-60 and KG-1 cells were obtained and prepared as described by 
Park et al. (/. Biol. Chem. 264:5420, 1989). AML-193 cells were obtained and prepared as 

15 described above in Example 4. As described by Park et al. (supra), 125I-GM-CSF does not 
bind to JM-1 cells nor does GM-CSF inhibit binding of to JM-1 cells, indicating that 

these cells possess receptors capable of binding only IL-3. Conversely, does not 

bind to HL-60 cells nor does IL-3 inhibit binding of 125i.gm-CSF to HL-60 cells, indicating 
that these cells possess receptors capable of binding only GM-CSF, In contrast, both KG-1 

20 and AML-193 cells bind 125i.GM-CSF and and, in addition, botii IL-3 and GM-CSF 

are able to partially compete specific binding of the heterologous radiolabeled ligand, with 
approximately equivalent capacities. This suggests that these cell lines possess receptors tiiat 
bind only IL-3, receptors that bind only GM-CSF, and receptors tiiat bind both GM-CSF and 
IL-3, all with high affinity. 

25 In order to determine the affinity of binding (Kj) of IL-3, GM-CSF and GM-CSF/IL-3 

fusion protein, inhibition assays were performed in which the ability of varying concentrations 
of these unlabeled proteins to inhibit binding of 125I-IL-3 to JM-1 cells, 125i-gM-CSF to HI^ 
60 cells and 125I-IL-3 and 125i.gM-CSF to KG-1 and AMI^193 ceUs were measured. Assays 
were performed by incubating cells (33 x lO^/ml) with 3 x 10-10 M 125i-GM-CSF or 125i-IL- 

30 3 and varying concentrations of unlabeled IL-3, GM-CSF or GM-CSF/IL-3 fusion protein for 
30-60 minutes at 37^C. Binding was assayed using the phthalate oil separation method 
disclosed by Dower et al, (7. Immunol 752:751, 1984), essentially as described by Park et al. 
(/, Biol Chem 2<J7:4177, 1986). Data was analyzed as described by Park et al. {Blood 74:56, 
1989). Binding affinities were detemiined for IL-3, GM-CSF and GM-CSF/IL-3, as shown in 

35 Table B, below. 
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TABLE B 



Labeled Unlabeled K t Values m l) 

5 ijsaad CamcfitiiQi IMd HLfiQ Kfij aml-193 

125I.IL.3 IL-3 1.8xl0l0 - 2.0xI0l0 . 4.1xl09 

GM-CSF/IL-3 6.1x109 - 2.8xl0" LSxIQIO 

125KJM-CSF GM-CSF - 1.2xl0l0 3.2xl0l0 l.fixlolO 

10 GM-CSF/IL-3 ~ 6.8xl09 1.9xl0l0 l^SxlQlO 



The experiments used to obtain the data in Table B were conducted using different cell lines in 
different experiments and accordingly show some variation, making direct comparison 
15 difficult In order to enable direct comparison of this data, the Kj values of both the controls 
and die fusion proteins were normalized to the Ki value for the control on one cell line. IL-3 
data were normalized to a Ki=1.8xlOiO M"! on JM-1 cells, and GM-CSF data were 
nomialized to a Ki=1.2xlOlO M-1 on HL-60 cells to give the values as set forth below in Table 
C. 

20 

TABLE C 



Labeled Unlabeled Nonnalized K j Values (M: h 

L!£2B!fl Competitor IMJ HL60 ICQJ AML-19:; 

25 . 

125I.IL.3 IL-3 1.8xlOlO „ I.SxIQIO I.SxIQIO 

GM-CSF/IL-3 6.1xl09 - 2.5xlOll 6.8xl0l0 

125i<}M-CSF GM-CSF - 1.2xlOlO 1.2xl0l0 1.2xlOlO 

GM-CSF/IL-3 - 6.8xl09 T.lxlO^ l.lxlQlO 

30 



Comparison of die normalized data indicates that the GM-CSF/IL-3 fusion protein and GM- 
CSF bind with approximately die same affinity to recq)tors for GM-CSF on HL-60, KG-1 and 
AML-193 ceUs. In contrast, die GM-CSF/IL-3 fusion protein and IL-3 bind with different 
35 affinities: GM-CSF/IL-3 fusion protein binds widi lower affinity tiian IL-3 to receptors on 
JM-1 cells (which have only IL-3 binding receptors); in contrast, the GM-CSF/IL-3 fusion 
protein binds with a significandy higher affinity than IL-3 to receptors on KG-1 and AML-193 
cells (both of which have GM-CSF/IL-3 receptors). Using the JM-1 cell line as a standard for 
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normal binding affinity of the GM-CSF/IL-3 fusion protein to a receptor (i.e., for binding to a 
receptor which is capable of binding only a single ligand), the GM-CSF/IL-3 fusion protein 
binds to KG-1 cells with a 41.0-fold higher binding affinity, and to AML-193 cells with an 
1 1.1-fold higher binding affinity. 

5 Not wishing to be bound by any particular theory, it is believed that the higher binding 

affinity of GM-CSF/IL-3 fusion protein to KG-1 and AML-193 cells is related to the presence 
in both of these cell lines of the GM-CSF/IL-3 receptor. In particular, the higher binding 
affinity of the GM-CSF/IL-3 fusion protein to die AML-193 cell line may explain the higher 
biological activity of the GM-CSF/IL-3 fusion protein in the thymidine incorporation assay of 

10 Example 4 which utilized the AML-193 cell line. 

Example 6 

Effect of GM-CSF/IL-3 on Proliferation of Human Bone Marrow 

15 The biological effect of GM-CSF/IL-3 on the proliferation of unftactionated human 

bone marrow was compared with that of GM-CSF and IL-3 alone. Non-adherent, low 
density, T cell deplated cultures of human bone marrow were plated in methylcellulose (BFU- 
E, CFU-GEMM, 40,000 cells per plate) or agar (CFU-GM; 40,000 cells per culture) as 
described by Lu et al.. Blood 61:250 (1983). Methylcellulose cultures contained 1 unit per 

20 plate erythropoietin and accounts for the background of 48±2 BFU-E in the absence of 
cytokine. Cultures were incubated in a 5% O2, 5% CO2, 90% N2 atmosphere for 14 days and 
counted wifli an inverted microscope. These values represent the mean ±1 standard deviation 
of duplicate or triplicate data points in one of two representative experiments. 
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TABLE D 



Cytokine 



10 



15 



20 



25 



30 



35 



None 

GM-CSF/IL-3 



GM-CSFfE^3 



GM-CSF 



IL-3 



Colonies afean+Snt 



Dose (ry^fmV) 


CFU-GEMM 




■DCTT t; 

prU'P, 




ot 


^+lt 




sooo 


10.3+1 


97±2 


107±5 


2500 


10.3±0.9 


74±1 


119±5 


1250 


6.8±0.5 


59±2 




625 


43±0.5 


52±6 


83+5 


312 


2.5±0.7 


35±2 


70±4 


156 


1.0±0.4 


24±3 


49±6t 


78 


Ot 


16±2 


50±3t 


5000+5000 


10.0+0.7 


54±3 




2500+2500 


8.510.5 


52±3 




1250+1250 


5,0+0.4 


47+4 




625+625 


2.5±0.3 


27±2 




312+312 


i.oto 


14+1 




156±156t 


0.3±0.3 


1Q±1 


4413t 


5000 


6.8±0.6 


42±4 


6313 


2500 


3.5±0.7 


42±2 


6111 


1250 


1.510.3 


33±2 


4714t 


625 


0.310.3t 


21±2 


4613t 


312 


Ot 


16±1 


t 


5000 


4.010.7 


16+0.3 


6712 


2500 


4.010.4 


9±1 


6612 


1250 


1.310.3 


4±lt 


• 4912t 


625 


Ot 


t 


4513t 


312 


Ot 


t 


t 



t = value equal to media control 
* p<0.05 compared to media control 
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TABLE E 



Colonies fMean+S.D.> 

5 CytQlting Posg (pg/ml) cfu-gemm cfij-gm bfu-e 





Ncme 




2+Ot 


Ot 


44±4t 




GM-CSF/IL-3 


5000 




43±5 




10 




2500 


10.0±1* 


45±5 


125+10* 






1250 


9.ato* 


23±2 


127±7* 






625 


6.Q±l' 


15±2 


97±2* 






312 


5.510.5 


8±1 


71±1* 






156 


2.0±lt 


5±1 


47+3+ 


15 




78 


2.0±0t 


2±0.3 


44±1+ 




GM-CSFfIL-3 


5000+5000 


6.5±1J* 


32±2 


93±3* 






2500+2500 


5.C±1* 


21±2 


71±8* 






1250+1250 


1.5±0.5t 


13±0.3 


44±2t 


20 




625+625 


2.0±0t 


7±2 


45±5t 






312+312 


t 


3±1 


t 






156+156 


t 


l±0.9t 


t 



t = value equal to media control 
25 * p<0.05 compared to media control 

Tables D and E indicate that GM-CSF plus IL-3 is approximately two fold more potent than 
either GM-CSF or IL-3 alone in enhancing proliferation of unfractionated human bone marrow 
cells. The GM-CSF/IL-3 fusion protein is equivalent in potentcy to a mixture of GM-CSF and 
30 IL-3, and that the mixture of GM-CSF and IL-3 showed an approximately two fold 
enhancement compared to GM-CSF or IL-3 alone. 

Example 7 

gynthesig of cDNA Encoding IL-3/GM-CSF Fugipn Pro^gin 

35 

A cDNA encoding a fusion protein comprising an N-terminal IL-3 and a C-terminal 
GM-CSF was constructed as follows. The yeast expression vector pIXY120 (described in 
Example IB) was digested with the restriction enzymes Asp718, which cleaves near the 3' end 
of the a-factor leader peptide (nucleotide 237), and Ncol, which cleaves in the polylinker. The 

40 large vector fragment was purified and ligated to an approximately 500bp Asp718-Ncol 
fragment (encoding GM-CSF(Leu^Asp27Giu39)) from a partial digest of L207-3 (ATCC 
67231), to yield pIXY273. A 9kb Asp718-Bgl2 fragment of pIXY273 (still containing the 
GM-CSF(Leu^Asp^'^Glu39) cDNA) was then ligated to an Asp718-Nrul fragment encoding 
human IL-3 (Pro8Aspl5Asp70) from pIXY151 (described in Example IB) and the following 

45 double stranded oligonucloetide: 
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5 ' CGATCrTTGGTGGCGGTGGATCCGGCGGTGGTGGTGGATCTGGTGGCGGATCTGCTCCAGCTA 3 ' 
3' GCTAGAAACCACCGCCACCTAGGCCGCCACCACCACCTAGACCACCGCCTAGi.CGAGGTCGATCTAG 5' 
-IL-3~>< LINKER X GM-CSF 

5 This oligonucleotide overlaps the 3* end of IL-3 by 8bp but does not include the stop codon, 
contains the Gly-Ser linker, and overlaps the 5* end of GM-CSF by lObp. The resulting 
vector was termed pIXY344 and was used to express an IL-3/GM-CSF fusion protein 
essentially as described above in Example 3. 



10 



Example 8 

Binding Acrivitv of IL-3/GM-CSF Fusion Protein in Equilibriuni Binding Assav 



Binding affinities of human IL-3, GM-CSF and IL-3/GM-CSF fusion protein 
(produced as described in Example 8) for receptors on human cells lines were detennined by 
15 inhibition of l^Sj^abeled IL-3 or GM-CSF binding as described in Example 5 above. 

Binding assays were performed using JM-1, HL-60 and KG-1 cells, which were 
obtauied and prepared as described by Park et al. (/, BioL Chem. 264:5420, 1989). JM-1 
cells possess receptors capable of binding IL-3, but not GM-CSR Conversely, HL-60 cells 
possess receptors capable of binding GM-CSF, but not IL-3. KG-1 cells possess receptors for 
20 both GM-CSF and 11^3. 

Binding affinities (Ki)were detennined for IL-3, GM-CSF and GM-CSF/IL-3, as 
shown in Table F, below. 



25 



Labeled 
Li gand 



TABLE F 



Unlabeled 
Competitor 



JMiL 



Kt Values ^M' l^ 



HL6Q 



KG-1 



30 



35 



125i_ii^3 IL.3 

GM.CSF/IL-3 
IL-3/GM-CSF 

125i^M-CSF GM-CSF 

GM-CSF/IL-3 
IL-3/GM-CSF 



6.0x10^ 
2.5x10^ 
12x10^ 



LSxlQlO 

5.4xl09 

3.0x10^ 



5.7xl09 

3.0x10^0 

2.2xl0l0 

NJ). 

RD. 

NJ). 



ND. = no data available 
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The above data indicate that the GM-CSF/IL-3 and IL-3/GM-CSF fusion proteins bind to JM-1 
cells with an affinity lower than that of IL-3 alone. In contrast, the GM-CSF/IL-3 and IL- 
3/GM-CSF fusion proteins bind to KG-1 cells with a. significantly higher affinity than that of 
lL-3 alone. The Kj value for both GM-CSF/IL-3 and IL-3/GM-CSF fusion proteins on KG-1 
5 cells is 10-20 fold higher than on JM-1 cells. Similarly, the Ki values determined for GM- 
CSF/IL-3 and IL-3/GM-CSF on HL-60 ceUs are similar. 

In view of the data shown in Examples 4-6 (which show a correlation between binding 
affinity and enhanced biological activity), the above binding data suggest that the IL-3/GM- 
CSF fusion protein will have increased biological activity. 
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0520-WO Qaims 

1. A fusion protein having a formula selected from the group consisting of 
5 R1-R2, R2-R1, R1-L-R2 and R2-L-R1 

wherein Ri is GM-CSF; R2 is IL-3; and L is a linker peptide sequence. 

2. A fusion protein according to claim 1, wherein said linker sequence comprises 
10 amino acids selected from the group consisting of Gly, Asn, Ser, Thr and Ala, 

3. A fusion protein according to claim 2, wherein the length of said linker sequence is 
5 to 15 amino acids. 

15 4. A fusion protein according to claim 3 selected from the group consisting of amino 

acid residues 1-271 depicted in Figure 1 and amino acid residues 1-275 depicted in Figure 2. 

5. A fusion protein according to claim 3, wherein the fusion protein is huGM- 
CSFIlxu23Asp27Glu39]/Gly4SeiGly5Ser/huIL-3[Pro8Aspl5Asp70]. 

20 

6. A fusion protein according to claim 1, wherein the human GM-CSF is selected 
from the group consisting of huGM-CSF, huGM-CSF[Leu23Asp27Glu39], huGM- 
CSF[Leu23], huGM-CSF[Leu23Asp27], huGM-CSF[Glu39], huGM-CSF [Asp27Glu39], 
huGM-CSF[Leu23 Glu39] and huGM-CSF[Asp27] and human IL-3 is selected from the group 

25 consisting of huIL-3, huIL-3[Pro8Aspi5Asp70], huIL-3[Asp70], huIL-3[Aspl5Asp70], huIL- 
3[Pro8Aspl5], huIL-3[Pro8Asp70], and huIL-3[Aspl5]. 

7. A DNA sequence encoding the protein of claim 1 . 
30 8. A DNA sequence encoding the protein of claim 4. 

9. A DNA sequence encoding the protein of claim 5, 
10. A DNA sequence encoding the protein of claim 6. 

35 



11, A DNA sequence according to claim 1, which is degenerate as a result of the genetic 
code to the DNA sequences defined in Figure 1 or Figure 2. 
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12. A recombinant expression vector comprising a DNA sequence according to claim 7. 

13. A recombinant expression vector comprising a DNA sequence according to claim 8. 

5 

14. A recombinant expression vector comprising a DNA sequence according to claim 9. 

15. A recombinant expression vector comprising a DNA sequence according to claim 10. 
10 1 6. A recombinant expression vector comprising a DNA sequence according to claim 1 1 . 

17. A process for preparing a fusion protein comprising GM-CSF and IL-3, 
comprising the step of culturing a suitable host cell comprising a vector according to claim 12 
xmder conditions promoting expression. 

15 

18. A process for preparing a fusion protein comprising GM-CSF and IL-3, 
comprising the step of culturing a suitable host cell comprising a vector according to claim 13 
under conditions promoting expression. 

20 19. A process for preparing a fusion protein comprising GM-CSF and IL-3, 

comprising tiie step of culturing a suitable host cell comprising a vector according to claim 14 
under conditions promoting expression. 

20. A process for preparing a fusion protein comprising GM-CSF and IL-3, 
25 comprising the step of culturing a suitable host cell comprising a vector according to claim 15 

under conditions promoting expression. 

21. A process for preparing a fusion protein comprising GM-CSF and IL-3, 
comprising the step of culturing a suitable host cell comprising a vector according to claim 16 

30 under conditions promoting expression. 

22. A con5)osition for regulating immune or inflammatory responses in a mammal, 
comprising an effective amount of a fusion protein according to claim 1, and a suitable diluent 
or carrier. 

35 

23. The use of a fusion protein according to any of claims 1-6 in preparing a 
medicament for regulating immune responses in a human. 
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24. The use of a fusion protein according to any of claims 1-6 in preparing a 
pharmaceutical composition suitable for parenteral administration to a human patient for 
regulating immune responses. 

25. A pharmaceutical composition suitable for parenteral administration to a human 
patient for regulating immune responses, comprising an effective amount of a fusion protein 
according to claim 1 in admixture with a suitable diluent or carriCT. 
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FIGURE 1 



GCT CCA GCT AGA TCT CCA TCT CCA 
Ala Pro Ala Arg Ser Pro Ser Pro 

GTT AAC CTC ATT CAA GAA GCT TTG 
Val Asn Ala He Gin Glu Ala Leu 

GAC ACT GCT GCT GAG ATG AAT GAA 
Asp Thr Ala Ala Glu Met Asn Glu 

ATG TTT GAC CTC CAG GAG CCG ACC 
Met Phe Asp Leu Gin Glu Pro Thr 

CTG TAC AAG CAG GGC CTG CGG GGC 
Leu Tyr Lys Gin Gly Leu Arg Gly 

CCC TTG ACC ATG ATG GCC AGC CAC 
Pro Leu Thr Met Met Ala Ser His 

ACC CCG GAA ACT TCC TGT GCA ACC 
Thr Pro Glu Thr Ser Cys Ala Thr 

TTC AAA GAG AAC CTG AAG GAC TTT 
Phe Lys Glu Asn Leu Lys Asp Phe 

TGC TGG GAG CCA GTC CAG GAG ggt 
Cys Trp Glu Pro Val Gin Glu Gly 

ggc ggc tea GCT CCC ATG ACC CAG 
Gly Gly Ser Ala Pro Met Thr Gin 

TGG GTT GAT TGC TCT AAC ATG ATC 
Trp Val Asp Cys Ser Asn Met He 

AAG CAG CCA CCT TTG CCT TTG CTG 
Lys Gin Pro Pro Leu Pro Leu Leu 

GAA GAC CAA GAC ATT CTG ATG GAA 
Glu Asp Gin Asp He Leu Met Glu 

CTG GAG GCA TTC AAC AGG GCT GTC 
Leu Glu Ala Phe Asn Arg Ala Val 

GCA ATT GAG AGC ATT CTT AAA AAT 
Ala He Glu Ser He Leu Lys Asn 

GCC ACG GCC GCA CCC ACG CGA CAT 
Ala Thr Ala Ala Pro Thr Arg His 

GAC TGG AAT GAA TTC CGG AGG AAA 
Asp Trp Asn Glu Phe Arg Arg Lys 

CTT GAG AAT GCG CAG GCT CAA CAG 
Leu Glu Asn Ala Gin Ala Gin Gin 

TTT 
Phe 



TCT ACT CAA CCA TGG GAA CAC 45 

Ser Thr Gin Pro Trp Glu His 15 

CGT CTC CTG GAC CTG AGT AGA 90 

Arg Leu Leu Asp Leu Ser Arg 30 

GAA GTA GAA GTC ATC TCA GAA 135 

Glu Val Glu Val He Ser Glu 45 

TGC CTA CAG ACC CGC CTG GAG 180 

Cys Leu Gin Thr Arg Leu Glu 60 

AGC CTC ACC AAG CTC AAG GGC 225 

Ser Leu Thr Lys Leu Lys Gly 75 

TAC AAA CAG CAC TGC CCT CCA 270 

Tyr Lys Gin His Cys Pro Pro 90 

CAG ATT ATC ACC TTT GAA AGT 315 

Gin He He Thr Phe Glu Ser 105 

CTG CTT GTC ATC CCC TTT GAC 360 

Leu Leu Val He Pro Phe Asp 120 

ggc ggt gga tec ggc ggt ggc 405 

Gly Gly Gly Ser Gly Gly Gly 135 

ACG ACG CCC TTG AAG ACC AGC 450 

Thr Thr Pro Leu Lys Thr Ser 150 

GAT GAA ATT ATA AC A CAC TTA 495 

Asp Glu He He Thr His Leu 165 

GAC TTC AAC AAC CTC AAT GGG 540 

Asp Phe Asn Asn Leu. Asn Gly 180 

AAT AAC CTT CGA AGG CCA AAC 585 

Asn Asn Leu Arg Arg Pro Asn 195 

AAG AGT TTA CAG GAC GCA TCA 630 

Lys Ser Leu Gin Asp Ala Ser 210 

CTC CTG CCA TGT CTG CCC CTG 675 

Leu Leu Pro Cys Leu Pro Leu 225 

CCA ATC CAT ATC AAG GAC GGT 720 

Pro He His He Lys Asp Gly 240 

CTG ACG TTC TAT CTG AAA ACC 765 

Leu Thr Phe Tyr Leu Lys Thr 255 

ACG ACT TTG AGC CTC GCG ATC 810 

Thr Thr Leu Ser Leu Ala He 270 

813 
271 
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FIGURE 2 

GCT CCC ATG ACC CAG ACG ACG TCC TTG AAG ACC AGC TGG GTT GAT 45 

Ala Pro Met Thr Gin Thr Thr Ser Leu Lys Thr Ser Trp Val Asp 15 

TGC TCT AAC ATG ATC GAT GAA ATT ATA AC A CAC TTA AAG CAG CCA 90 

Cys Ser Asn Met He Asp Glu He He Thr His Leu Lys Gin Pro 30 



CCT 
Pro 


TTG 
Leu 


CCT 
Pro 


TTG 
Leu 


CTG 
Leu 


GAC 
Asp 


ATT 
He 


CTG 
Leu 


ATG 
Met 


GAA 
Glu 


TTC 
Phe 


AAC 
Asn 


AGG 
Arg 


GCT 
Ala 


GTC 
Val 


AGC 
Ser 


ATT 
He 


CTT 
Leu 


AAA 
Lys 


AAT 
Asn 



GAC TTC AAC AAC CTC AAT 
Asp Phe Asn Asn Leu Asn 

AAT AAC CTT 
Asn Asn Leu 

AAG AGT TTA 
Lys Ser Leu 

CTC CTG CCA TGT CTG CCC 
Leu Leu Pro Cys Leu Pro 



GGG GAA GAC CAA 135 
Gly Glu Asp Gin 45 



CTG GCC ACG GCC 270 
Leu Ala Thr Ala 90 



CGA AGG CCA AAC 
Arg Arg Pro Asn 

CAG GAC GCA TCA 
Gin Asp Ala Ser 



CTG GAG GCA 180 

Leu Glu Ala 60 

GCA ATT GAG 225 

Ala He Glu 75 



GCA CCC ACG CGA CAT CCA ATC CAT ATC AAG GAC GGT GAC TGG AAT 315 

Ala Pro Thr Arg His Pro He His He Lys Asp Gly Asp Trp Asn 105 

GAA TTC CGG AGG AAA CTG ACG TTC TAT CTG AAA ACC CTT GAG AAT 360 

Glu Phe Arg Arg Lys Leu Thr Phe Tyr Leu Lys Thr Leu Glu Asn 120 

GCG CAG GCT CAA CAG ACG ACT TTG AGC CTC GCG ATC TTT GGT GGC 405 

Ala Gin Ala Gin Gin Thr Thr Leu Ser Leu Ala He Phe Gly Gly 135 

GGT GGA TCC GGC GGT GGT GGA TCT GGT GGC GGC GGA TCT GCT CCA 450 

Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala Pro 150 

GCT AGA TCT CCA TCT CCA TCT ACT CAA CCA TGG GAA CAC GTT AAC 4 95 

Ala Arg Ser Pro Ser Pro Ser Thr Gin Pro Trp Glu His Val Asn 165 

GTC ATT CAA GAA GCT TTG CGT CTC CTG GAC CTG AGT AGA GAC ACT 540 

Ala He Gin Glu Ala Leu Arg Leu Leu Asp Leu Ser Arg Asp Thr 180 

GCT GCT GAG ATG AAT GAA GAA GTA GAA GTC ATC TCA GAA ATG TTT 585 

Ala Ala Glu Met Asn Glu Glu Val Glu Val He Ser Glu Met Phe 195 

GAC CTC CAG GAG CCG ACC TGC CTA CAG ACC CGC CTG GAG CTG TAC 630 

Asp Leu Gin Glu Pro Thr Cys Leu Gin Thr Arg Leu Glu Leu Tyr 210 

AAG CAG GGC CTG CGG GGC AGC CTC ACC AAG CTC AAG GGC CCC TTG 675 

Lys Gin Gly Leu Arg Gly Ser Leu Thr Lys Leu Lys Gly Pro Leu 225 

ACC ATG ATG GCC AGC CAC TAC AAA CAG CAC TGC CCT CCA ACC CCG 720 

Thr Met Met Ala Ser His Tyr Lys Gin His Cys Pro Pro Thr Pro 240 

GAA ACT TCC TGT GCA ACC CAG ATT ATC ACC TTT GAA AGT TTC AAA 765 

Glu Thr Ser Cys Ala Thr Gin He He Thr Phe Glu Ser Phe Lys 255 

GAG AAC CTG AAG GAC TTT CTG CTT GTC ATC CCC TTT GAC TGC TGG 810 

Glu Asn Leu Lys Asp Phe Leu Leu Val He Pro Phe Asp Cys Trp 270 



GAG CCA GTC CAG GAG 
Glu Pro Val Gin Glu 



825 
275 
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